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Abstract 

We use a simplified version of the halo model with a power law power spectrum to 
study scale dependence in galaxy bias at the very large scales relevant to baryon 
oscillations. In addition to providing a useful pedagogical explanation of the scale 
dependence of galaxy bias, the model provides an analytic tool for studying how 
changes in the Halo Occupation Distribution (HOD) impact the scale dependence 
of galaxy bias on scales 10 < < 1000/i _1 Mpc, which is useful for interpreting 
the results of complex N-body simulations. We find that changing the mean number 
of galaxies per halo of a given mass will change the scale dependence of the bias, 
but that changing the way the galaxies are distributed within the halo has a smaller 
effect on the scale dependence of bias at large scales. We use the model to explain 
the decay in amplitude of the baryon oscillations as k increases, and generalize the 
model to make predictions about scale dependent galaxy bias when redshift space 
distortions are introduced. 



1 Introduction 

The coupling of photons and baryons by Thompson scattering in the early 
universe results in gravity driven acoustic oscillations of the photon-baryon 
fluid. The features that appear in both the Cosmic Microwave Background 
(CMB) anisotropies and matter power spectra are snapshots of the phases of 
these oscillations at the time of decoupling, and provide important clues used 
to constrain a host of cosmological parameters. Features in the matter power 
spectrum, referred to as baryon (acoustic) oscillations, have the potential to 
strongly constrain the expansion history of the universe and the nature of the 
dark energy. These features in the matter power spectrum induce correlations 
in the large-scale clustering of the IGM, clusters or galaxies. Indeed a large, 
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high redshift, spectroscopic galaxy survey encompassing a million objects has 
been proposed (1) as a dark energy probe, building on the successful detection 
(at low redshift) of the features in the clustering of the luminous red galaxy 
sample of the Sloan Digital Sky Survey (2). Key to this program is the means 
to localize the primordial features in the galaxy power spectra, which nec- 
essarily involves theoretical understanding of such complex issues as galaxy 
bias, non-linear structure evolution, and redshift space distortions. These ef- 
fects can shift the observed scales of peaks and troughs in observations of the 
baryon oscillations, affecting the transverse and radial measurements differ- 
ently. Marginalizing over this uncertain scale dependence can be a large piece 
of the error budget of proposed surveys. 

Inroads into modeling galaxy bias, non-linear evolution, and redshift space dis- 
tortions have been made by several groups using N-body simulations (3), but 
these simulations are complex and often mask the underlying physical mech- 
anisms. A route to understanding the sophisticated simulations is provided 
by an analytic tool known as the halo model (4). The halo model makes the 
assumption that all matter in the universe lives in virialized halos, and that 
the power spectrum of this matter can be divided up into two distinct con- 
tributions; the 1-halo term arising from the correlation of matter with other 
matter existing in the same halo, and the 2-halo term quantifying the corre- 
lation of matter with matter that lives in a different halo. The halo model is 
extended to calculate galaxy power spectra through the introduction of a Halo 
Occupation Distribution (HOD), that describes the number of galaxies that 
exist in a halo of a given mass, and their distribution inside that halo. The 
HOD will naturally depend on the details of galaxy formation, but is not in 
itself a theory of galaxy formation. Rather it is a description of the effects of 
galaxy formation on the number and distribution of the galaxies. The impact 
of galaxy formation on cosmological observables such as the baryon oscilla- 
tion can be studied in the halo model by investigating the sensitivity of the 
observable to changes made to the HOD. 

In this paper, we employ a simple version of the analytic halo model to study 
the origins of scale dependence in galaxy bias. We investigate the impact of 
changing the HOD on the observed spectra, and show that the scale depen- 
dence of the bias arises in a natural way from extending the dark matter 
description to a description of galaxies. Specifically, in generalizing the de- 
scription to an ensemble of rare tracers of the dark matter, the 1-halo and 
2-halo terms in the power spectrum are shifted by different amounts. The 
scale dependence in the galaxy bias arises from this difference. We find that 
for small k, the galaxy power spectrum is sensitive to the number of galaxies 
that occupy a halo, but not to their positions within the halo. We quantify 
the impact of redshift space distortions, which find a natural description in 
the halo model, and discuss the implications for large galaxy redshift surveys. 
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2 The Halo Model 



We will try to understand the scale-dependence of the galaxy bias by exam- 
ining a simple model. Our goal is not to make precise predictions, but rather 
to use simple analytic approximations to help interpret the results of N-body 
simulations, which are much more appropriate for studying the fine details. 

Our investigation makes use of the halo model (4), which assumes that all of 
the mass and galaxies in the universe live in virialized halos, whose clustering 
and number density is characterized by their mass. This model can be used 
to approximate the two point correlation function of the mass, and of various 
biased traces of the mass, such as luminous galaxies. In this framework, there 
are two contributions to the power spectrum: one arises from the correlation 
of objects that reside in the same halo (the 1-halo term), and the other comes 
from the correlation of objects that live in separate halos (the 2-halo term). 
For the dark matter, for example, the dimensionless power spectrum can be 
written 

2 _ k 3 P dm (k) _ 2 2 

m = 271^ ~~ lh dm + 2h^dm l-U 
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with M the virial mass of the halo, p the mean background density, rih{M) 
the number density of halos of a given virial mass, and bh(M, k) the halo bias 
(5). The function y(M,k) is the Fourier transform of the halo profile which 
describes how the dark matter is spatially distributed within the halo. 

Expressed this way, Eqs. (2) and (3) lend themselves to a fairly intuitive 
interpretation. In the two halo term, the dark matter being correlated lives in 
widely separated halos, of different masses. For each of the two halos, the mass 
is multiplied by the function y that governs its spatial distribution within a 
halo, weighted by the number density of halos rih with bias bh, and integrated 
over all possible halo masses. The one halo term is even simpler - correlating 
two bits of dark matter residing in the same halo. Thus there are two factors 
of M x y weighted with the number density of halos iih, and integrated over 
all halo masses. 

We can generalize this framework to compute the 1- and 2-halo terms for a 
galaxy population that traces the density field (4). We divide the galaxies into 



3 



two sub-populations, centrals and satellites. The centrals will reside at the 
center of the host halo, while the satellites will trace the dark matter. The 
Halo Occupation Distribution (HOD) sets the number of tracers in a halo of 
mass M. We assume there is either a central galaxy or not, and the number 
of satellites is Poisson distributed (6). For our model we will use 



(iV c ) = 0(M-M min ) 
(N s ) = 0(M - M min ) 



M 



sat 



(4) 
(5) 



where O is the Heaviside function, and M min < M sat . Note that the central 
galaxies do not trace the halo profile, and are not weighted by y. The gener- 
alization of the 1-halo and 2-halo terms is given by (4) 
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is the number density of galaxies. The interpretation of these expressions is 
similar to that of the dark matter. 



For the purposes of this toy model, we shall adopt a power law for the linear 
power spectrum, 




K 



3+n 



(9) 



and define a dimensionless wavenumber k = kjk*. In order to simplify many 
of the expressions in the calculation, we find it useful to change variables from 
M to a dimensionless quantity, u, related to the peak height of the overdensity. 
For the power law model considered here v is a simple function of the mass: 



u(M) 
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Here, 8 C = 1.686 and a(M) is the linear theory variance in top hat spheres 
of radius R = (3M/47rp) 1//3 . We have introduced a a dimensionless mass m in 
terms of the scale mass M*, the mass for which a{M ir ) = 5 C and v — 1. Note 
M* is a function of the power spectrum normalization /c* and the index n, and 
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can be computed using the relation 



3ji(fcfl) 
kR 

where j\ is the spherical Bessel function of order 1. 
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The mass function rih(M) takes a simple form when expressed in terms of the 
multiplicity function, f(u). The multiplicity function is a normalized number 
density of halos at a given mass: 



M r 
f(v)dv = —n h {M)dM with f(v)dv=l . (12) 

P J 



For the Press-Schechter (P-S) mass function (7) 

e -v/2 

m = -z= (13) 
y2itv 

and the halos form biased tracers of the density field. On large scales, for small 
fluctuations, the bias is (5) 

h(") = 1 + ^ (14) 

which satisfies / dvf{y)b{y) = 1. In detail of course the halos do not provide a 
linearly biased tracer of the linear density field. On smaller scales both higher 
order terms and halo exclusion effects give rise to a scale-dependence. Both 
analytic calculations (5) and simulations (3) suggest that this is a few percent 
correction on the scales of interest to us and we shall henceforth neglect it. 

When looking at large scales, such as those relevant to the baryon wiggles in 
the linear power spectrum, the function y(M, k) can be accurately approxi- 
mated by a Taylor expansion into powers of kr v , where r v is the virial radius, 
which depends upon the mass. Assuming an NFW form (10) for y(M, k) and 
expressing the mass dependence explicitly, the expression is 

y(M, k) = 1 + c 2 {kr v f + c 4 (kr v ) 4 + ■■■ 

= 1 + c 2 (hn) 2 K 2 m 2 / 3 + c^r^Vm 4 / 3 + • • • (15) 

Here we have introduced another quantity, the virial radius of an M* halo, 
= r v (M+) = (3M Vr /47rAp) 1 / 3 , where A is the virialization overdensity, which 
we will take to be A = 200. The expansion coefficients c 2 and c 4 are functions of 
the halo concentration, and for the NFW model are ratios of gamma functions. 
For cluster sized halos we expect c ~ 5 which leads to c 2 — —0.049 and 
c 4 ~ 0.0014, while for galaxies c ~ 10 making c 2 — —0.04 and c 4 ~ 0.0011. 
The quantity k±r\ can be computed using the relation in Eq. (11), and turns 
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Table 1 



Expressions and values for k±r+ and A* in terms of 8 C = 1.686 and the virializa- 
tion overdensity A = 200. Here A;* is the normalization of the dark matter power 
spectrum, and r* = r v (M±) is the virial radius of a halo of mass M+. The factor 
A* = k+M+/(27r 2 p) relates the amplitude of the 1- and 2-halo terms and 7d m is 
defined in Eq. (18). 

out to be a function only of the index n. We have tabulated the expressions 
and values of k*r+ in Table 1. On large scales, where k < k+, we see that the 
coefficients of the last two terms in the expression for y(M, k) are extremely 
small. We have repeated our analysis neglecting these terms and find that 
these are insignificant corrections to the scale dependence we are studying. 
This is consistent with the results of (8), who found that the local relation 
between galaxies and mass within a halo does not significantly impact the 
large scale galaxy correlation function. For simplicity we shall set y(M, k) — 1 
for the remainder of the paper. 



3 Results real space 



Having argued that we can safely approximate y and bh as scale independent 
quantities when studying clustering at large scales our expressions simplify 
dramatically. The mass power spectrum is simply 3 

A 2 dm (k) = k 3 (K n + A* ldm ) K«l (16) 

where A+ and 7d m are n-dependent constants 

POO 

7dm = / m(u)f(u)du = 2 3 /( 3 +™) 7r - 1 / 2 T [1/2 + 3/(n + 3)] (18) 
Jo 

We list the values of A* and 7 dm for some values of n in Table 1. Referring 
to the Table we see that, for n near —1, the 1-halo term dominates only for 



3 If appropriate halo profiles, e.g. Gaussians, are chosen the full A;-dependent inte- 
grals can also be done in terms of special functions. 
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k > k+, outside of the range of relevance for us. 



The scale-dependent bias can be defined as 

BHk) = § (19) 

2h^dm +lh A dm 

which can be re-written to explicitly exhibit its scale dependence as 

M&K) 2 (l + (n- n + ---) (21) 

where a g , f3 g and 7 9 are dimensionless integrals of u, a g is the galaxy number 
density in dimensionless units, f3 g /a g is the galaxy weighted halo bias and 7 S 
counts' the number of galaxy pairs in a single halo 4 . We have neglected terms 
higher order in k. The term K~ n encodes the leading order scale dependence 
and is proportional to the inverse of the linear dark matter power spectrum. 
Choosing a = 1 in our HOD as a representative example the relevant integrals 
are 



a g 



m{v)- l f{v) [1 + m(z/)/m sat ] dv (22) 

- min 
roc 

g = / m{v)- l f{v) [1 + m(z/)/m sat ] b h {v) dv (23) 
7 S = / m{v)- x f{v) [2m(i/)/m sat + (m(z/)/m sat ) 2 J dv (24) 

and the factor governing the scale-dependence is 

C^min, "Isat, = A, (lg/Pn ~ 7dm) (25) 



Note ( depends on the number of pairs of galaxies divided by the square of 
the large-scale bias. If one wishes to reintroduce the halo profiles there is a 
simple modification to the integrals. In a g , f3 g , and j g , every occurrence of 
m(v)/m SSit will be multiplied by y(v,k). For the dark matter, 7 dm will have 
an extra factor of \y(v, k)\ 2 in the integrand, and the 1 in the denominator of 
Eq. (20) will be replaced by / b h (v)f(v)y(v, k)dv squared. 

In this form it is clear that the scale-dependent bias arises because the 1- 
and 2-halo terms for the galaxies are different multiples of their respective 
dark matter counterparts. Typically the 1-halo term is enhanced more than 
the 2-halo term, leading to an increase in the bias with decreasing scale. A 
cartoon of this is shown in Fig. 1. The relative enhancements of the 1 and 
2-halo terms depend on the HOD parameters for the galaxy population used 



4 In the limit y(v, k) = 1 we have a^m = Aim = 1- The expression for 7dm is given 
in Eq. (18). 
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Fig. 1. A cartoon illustrating the difference in the shift of the 1-halo and 2-halo 
terms (dashed lines) of the galaxy power spectra with respect to the dark matter. 
Because of this difference, the 1-halo term dominates on larger scales for the galaxy 
spectrum. This leads to a change in the ratio of the total power (solid curves), which 
leads to a scale dependent galaxy bias. 



as tracers. Note also that in our simple model the 2-halo term retains the 
oscillations of the linear theory spectrum, while in the 1-halo term they are 
absent. This provides a partial 5 explanation of the reduction of the contrast 
of the oscillations with increasing k. 



Figure 2 shows £ vs. u min for several different values of the other HOD param- 
eter M sat . We see that the scale dependence is more prominent as the number 
of satellite galaxies is increased, and that a higher threshold halo mass for con- 
taining a central galaxy leads to a more scale dependent bias at large scales. 
At fixed number density ( increases with increasing bias and is more rapidly 
increasing for rarer objects. At fixed bias ( is larger the rarer the object. Our 
model is not sophisticated enough to expect good agreement with large N-body 
simulations, however comparing to the work of (3) we find good qualitative 
agreement in the scale-dependence of the bias for 0.01 < k/(h Mpc~ ) < 0.1. 

We note that the 1- and 2-halo decomposition leads us to a new parame- 
terization of the scale-dependent bias. In the limit where halo profiles and 
scale-dependent halo bias can be neglected the most natural description of 



5 In a more complex/realistic model the 2-halo term involves the non- linear power 
spectrum and non-linear bias of the tracers, including halo exclusion effects. Mode 
coupling thus appears to reduce the baryon signal even at the 2-halo level. 
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Fig. 2. The factor £ that governs the strength of the scale dependent part of the 
galaxy bias. 

the galaxy spectrum is 



(26) 



which has two free parameters, 6 and k\. We expect this will describe the 
largest part of the scale-dependent bias. Non-linear bias, halo exclusion and 
profiles will show up as smaller corrections to this formula, such as a scale- 
dependence in b. It is difficult to compare the scale dependence in this frame- 
work to other treatments of scale dependent bias (e.g. (11)) where the galaxy 
density contrast is expanded in moments of the matter density contrast {pi, 
62, etc.), because the matter density contrast itself has both 1-halo and 2- 
halo contributions, and furthermore we have not extended our analysis to the 
bispectrum or higher order. 



4 Results redshift space 



Keeping to our philosophy of examining qualitative behavior in a simple 
model, we can extend these results to redshift space. The 2-point function 
in redshift space differs from that in real space due to two effects (12). The 
first, effective primarily on very large scales, accounts for the fact that dark 
matter and the galaxies that trace it have a tendency to flow toward over- 
densities as the structure in the universe is being assembled, enhancing the 
fluctuations in redshift space (9). The second comes into play inside virial- 
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ized structures, where random motions within the halo reduce fluctuations in 
redshift space. 



These corrections impact the 1-halo and 2-halo terms. The inflow effect pri- 
marily impacts the 2-halo term while virial motions primarily affect small 
scales which are dominated by the 1-halo term (13). The boost in the ob- 
served density contrast, 8k, due to instreaming is given 6 by (1 + ip 2 ) where 
p = r-k and f ~ f2^ 6 (9). The small scale suppression we take to be Gaussian. 
In general, when extending the model to galaxies tracing the dark matter, one 
should distinguish between central and satellite galaxies, since the latter have 
much larger virial motions and will therefore suffer more distortion in redshift 
space. We approximate this by taking a 2 cen ~ and <r 2 sat = GM/2r vir . Con- 
verting from velocity to distance we have for an M* halo a* 



Defining y s {v,k) = y(u, k) e -( fcCT «M) 2 / 2 the 1- and 2-halo terms are then given 

by (13) 
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(30) 



for the galaxies. As discussed in (13), in the 2-halo term the effect of peculiar 
velocities, going as fp 2 , is governed by the mass rather than the galaxy density 
field, requiring the addition of a separate integral over v. Conceptually, this 
term is added to account for extra clustering in redshift space induced by 
the bulk peculiar flow of the galaxies in one halo under the influence of the 
dark matter in other halos. For some purposes it is useful to average over 



6 It has been argued in Ref. (14) that the form 1 + fp 2 is not highly accurate on 
the scales relevant to observations and higher order corrections apply. Since it is 
our intent to gain qualitative understanding rather than quantitative accuracy we 
shall use the simplest form: 1 + fp 2 . Deviations from this will be yet another source 
of scale-dependence, but numerical simulations suggest it is small. 
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orientations of the galaxy separations (i.e. integrate over jj) but in the case of 
studying baryon oscillations, doing so throws away valuable information. 



As before we note that y(v,k) 1 and exp[— (ka v fi) 2 /2] ps 1 for k <C k* 
so the effect of redshift space distortions is primarily to enhance the 2-halo 
term - this makes the power spectrum "more linear" in redshift space than real 
space. However the second of our approximations, exp[— (ka v fi) 2 /2] ps 1, is not 
as good as the first, y(v, k) & 1, so there is enhanced /c-dependence from the 
individual terms. For the interesting range of n, k+a* ~ 1 so the exponential 
can only be neglected when /t 2 z/ 2 /( n + 3 ) <c 1 for all values of v that significantly 
contribute to the integral; i.e. near the peak of the integrand. For example, we 
see scale dependence at smaller k in the 1-halo term in redshift space than in 
real space. At k = 1/2, the exponential term induces a 13-14% change in the 
1-halo terms along the line of sight for both the dark matter and a moderately 
biased sample of galaxies, leading to a percent level correction in the ratio of 
power spectra. The error decreases rapidly as decreases. The importance 
of the exponential factor depends somewhat on the HOD parameters. The 
correction to the galaxy 1-halo term is larger as M min increases, but is smaller 
as M sat increases, due to the decreasing number of satellite-satellite pairs. 

For completeness we write the scale-dependent bias in redshift space in the 
approximation that y s (v, k) ~ 1. 



where we have defined E g = 1 + (a g f//3 g ){i 2 and S dm = 1 + f/i 2 to simplify the 
equations. 



5 Conclusions 

Models of structure formation where f2b ^ predict a series of features 
in the linear theory matter power spectrum, akin to the acoustic peaks seen 
in the angular power spectrum of the cosmic microwave background. These 
peaks provide a calibrated standard ruler, and a new route to constraining 
the expansion history of the universe. In order to realize the potential of this 
new method, we need to understand the conversion from what we measure - 
the non-linear galaxy power spectrum in redshift space - to what the theory 
unambiguously provides - the linear theory matter power spectrum in real 
space. The ability of N-body simulations to calibrate this mapping is improving 




(32) 



(31) 
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rapidly, but the complexity of the simulations can often mask the essential 
physics. In this paper we have tried to investigate the issues using a simplified 
model which can give qualitative insights into the processes involved. 

In our toy model we find that the distribution of galaxies within halos and 
the complexities of scale-dependent halo bias are sub-dominant contributions 
to the scale-dependence of galaxy bias. The dominant effect is the relative 
shifts of the 1- and 2-halo terms of the galaxies compared to the matter. The 
amplitude of the scale dependent bias on very large scales is parameterized 
by a quantity, (, which depends on the galaxy HOD. For our two parameter 
HOD we find ( increases with increasing bias at fixed number density and is 
more rapidly increasing for rarer objects. At fixed bias, ( is larger the rarer 
the object. 

The 1- and 2-halo decomposition leads us to a new parameterization of the 
scale-dependent bias. In the limit where halo profiles and scale-dependent halo 
bias can be neglected the most natural description of the galaxy spectrum is 



which has two free parameters, b and k\. This is very close to the phenomeno- 
logically motivated form proposed by (15). The extra /c-dependence these au- 
thors allowed in their multiplicative and additive terms can be understood here 
as the effect of non-linear power, non-linear bias and halo exclusion and halo 
profiles. The corrections appear first in the 2-halo term and then at smaller 
scales in the 1-halo term. Our results also suggest that on very large scales, 
the bias in configuration space has relatively little scale dependence because 
the effects of the 1-halo term are strictly limited to scales smaller than the 
virial radius of the largest halo. 

We would like to thank D. Eisenstein and R. Sheth for conversations. The 
simulations referred to in this work were performed on the IBM-SP at NERSC. 
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